Intonation Modelling for the Synthes

نویسندگان

  • Jeska Buhmann
  • Jean-Pierre Martens
چکیده

Human readings of structured documents exhibit a much richer intonation than that observed in read isolated sentences. It is a challenge to capture this richness in an automatic way using datadriven techniques. In this paper, we extend our previous research on intonation modelling for isolated sentences in different respects: (i) the RNN (Recurrent Neural Network) intonation model is now trained and evaluated on read documents, (ii) the model is evaluated as part of the overall prosody model, (iii) the feature selection process is completely automated, and (iv) the importance of textlevel features such as text type, text structure and type-setting are investigated. It is demonstrated that acceptable intonation models can be constructed starting from a database that does not contain any explicit hand labelling of the intonation contours. It also appears that text type and text structure are important features whereas type-setting is not.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimized Selection of Intonation Dictionaries in Corpus Based Intonation Modelling

Data scarcity in corpus-based intonation modelling for TTS applications is addressed. We propose to apply a searching process to a list of dictionaries of classes of intonation patterns previously trained from corpus to avoid problems associated with the scarce number of samples in the classes. Results indicate that better results are obtained in comparison with previous alternatives where the ...

متن کامل

Optimized selection of intonation dictionaries in corpus based intonation modelling

Data scarcity in corpus-based intonation modelling for TTS applications is addressed. We propose to apply a searching process to a list of dictionaries of classes of intonation patterns previously trained from corpus to avoid problems associated with the scarce number of samples in the classes. Results indicate that better results are obtained in comparison with previous alternatives where the ...

متن کامل

F0 stylization and intonation modelling for Standard Yorùbá Text-to-speech application

This technical report documents experiment into stylization of the f0 curve on Standard Yorùbá (SY ) syllables as well as a technique for intonation modelling. A number of interpolation polynomials were evaluated using root mean square error and mean opinion score techniques. The stylisation experiment resulted in the selection of a 3 degree polynomial for modelling the f0 curves on Yorùbá syll...

متن کامل

Modelling Japanese intonation using PENTAtrainer2

This paper presents results from Japanese intonation modelling using PENTAtrainer2, an articulatory synthesiser. Our first aim is to show that PENTA, on which PENTAtrainer2 is based, can achieve high accuracy in predictive synthesis of varying intonation contours. We trained the synthesiser on a 6251-sentence functionally annotated corpus and generated F0 contours for each communicative conditi...

متن کامل

Intonation modelling using a muscle model and perceptually weighted matching pursuit

We propose a physiologically based intonation model using perceptual relevance. Motivated by speech synthesis from a speech-to-speech translation (S2ST) point of view, we aim at a language independent way of modelling intonation. The model presented in this paper can be seen as a generalisation of the command response (CR) model, albeit with the same modelling power. It is an additive model whi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002